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DNA Base Sequence Homology between CobPhag« 
17 and m and between T3 and #n as deternnned by 
Heteroduplex Mapping in the Electron Microscope 

iBeoeioed 15 Janvary 1973. and in revised form 19 March 1973) 

1- u«««o ^7 and 611 was investigated 
The genetic relatedness of the '^^^ ^fjlfi^^^^^ formed ^vith one 

^eUon mioroscopco^^^^^^^^^ from ^11. 

DNA strand from cohphage T7 ^ . . denaturing conditions, below 

?he T7-^II heteroduplex « atooJm^^^ ^^J^^ 
the melting temperature of T7 DNAjms ^^^^^ complete 

of T7 and ^11 have regions ^f/'^^^f ^j^^^^^^ 

non-homology. In direct contrast the T3^^^^^^^ ^^^^^^ ^ double- 

ing conditions. The fraction of the ^3^11 ^ew P ^^^^ ^.iti^g 

• Sb~hS rrTatirnTl^"- BKAs of T3 and ,11 have 
; extensive sequences of partial homology. 

i 1. Introduction 

Ba^ . Hyman (1971) observed ^-^^ ^^^^^^^^^^^ 

duplex appearing as doable-stranded d^^^^^^^ ^^^^^^^^ 

conditions, below ^nsT^ete^Se-ppeared ir^variant with denatur- 

observed in contrast that the ^"^^^ hetero^^^^ ^^^^ ^^^^ ^wo phages 

ation conditions, over the same range. The hetero^^^ is composed of stretches of 

capable of extensive P^^^^f ^V"" anHS^hes o^^^^^ non-homology. The 
complete base sequence ^-^'''"^'^'^^^^^^^^^ extensive productive 

• heteroduplex derived from .oj^^^ .^SnTed 'stretches of partial homology, 
recombination, .^f ' Xrobse^^^^^^ 

random base changes. ^^nfipotion of the hypothesis, we noted that T7 

Seeking further experunental ^'^^Pj^''^'^^^^ " ^^j. We predicted, observed, 

and its relative ^11 do -d-^-^— ^^^^^T 
and report here that the TT^I^heWup^^ 

::^:r:s;ht::::^'-t^^^^^^ ^'''°^'^^- 
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2. Materials and Methods 



The T7 Btrain used here was from Dr F. W. Studier, who had obtained it at Caltech 
from Dr R. L. Sinsheimer. The T7 strain used in the previous study (Davis & Hyraan^ 
1971) was from the Caltech stocks of the late Dr J. J. Weigle. Both these strains are 
derived from the collection of Dr M. Delbruck. The heteroduplex composed of one strand 
of the Weigle stock and the other of the Studier stock appeared completely duplex when 
observed in the electron microscope (M. N. Simon, personal commimication). The Studier 
stock has been used because of its extensive genetic characterization (Studier, 1969). 
Strain T3 was from the American Type Culture Collection, and is the same straiu as 
used in the previous study (Davis & Hyman, 1971). Strain ^11 was from the collection of 
Dr M. Malamy, who obtained it from Dr K. B. Low, who received it from Dr E. Adelberg, 
who had obtained it at the Pasteur Institute in about 1959 (Linial & Malamy, 1970). 

The procedures for separatiag strands of the various DNAs and for forming hetero- 
duplexes (in this case, containiag the T7 Z-strand and the ^11 r-strand or the T3 r-strand 
and the ^11 Z-strand) have been described previously (Davis & Hyman, 1971). The basic 
protein film technique was described by Davis et al, (1971). The T7-^II and the TS-^EE 
heteroduplexes were mounted xmder four different isodenaturing formamide concentra- 
tions. The isodenaturing procedure has been described in detail before (Davis & Hyman, 
1971). 

3. Results 

A sample electron micrograph of the heteroduplex (at iTjn— 28°C) is shown 

in Plate I. The T7-<^II heteroduplex is almost entirely duplex, though a deletion loop 
close to one end followed shortly by two characteristic non-homology loops is seen 
(Plate I, Fig, 1). There are four to six small non-homology loops on each molecule. 
The two single strands of these loops appear about equal in length. 

The deletion loop of the T7 — ^11 heteroduplex can be used to position the hetero- 
duplex left to right on the T7 physical map. The T7-<^II heteroduplex is compared to 
the heteroduplex composed of ^11 and the DNA deletion mutant of T7, called T7H1 
(Studier, 1972). T7H1 harbors a DNA deletion from 2-9% to 7-0% from the left end 
(M. N. Simon & F. W. Studier, personal communication). If the deletion loop of the 
T7-^II heteroduplex is near the right end, then the T7H1-^II heteroduplex will 
show deletion loops near both ends. If the T7-^II deletion loop is near the left end, 
then the T7Hl-^n heteroduplex will be duplex on the right end and changed on the 
left end. When the T7H1-^II heteroduplex was examined, no deletion loop was seen; 
both ends were duplex. The two characteristic non-homology loops which were 
previously observed near the T7-^II deletion loop were still present. Therefore, the. 
T7-^II deletion loop is on the left end. It can be concluded that this loop is formed 
because of a deletion in <^II DNA relative to T7 DNA and that this deletion covers 
the region equivalent, within experimental error, to the HI deletion of T7 DNA. 

Confirmation of this assignment of the left end of the T7-^n heteroduplex comes 
from a comparison of the early RNAs of T7, T7H1 and ^11. The early region of T7 
extends from 1% to 20% from the left end (Hyman, 1971 ; Studier, 1972). The RNAs.' ■ 
transcribed from this region by the Escherichia coli RNA polymerase form charac- ^ 
teristic bands upon electrophoresis in polyacrylamide gels (Siegel & Summers, j; 
1970). It is predicted that the pattern for <f>ll early RNA will match that of T7H1 1 
early RNA rather than the pattern of T7*^ early RNA. This prediction has heenj 
verified experimentally (Summers et al, 1973; Brunovskis et at,, 1973). iM 

The T7-^II heteroduplex maps are presented in Figure 1. Most of the features arej^ 
invariant with formamide concentration : the deletion in ^11 DNA extending froia^ 
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Fig. 1. T7-^n heteroduplex maps. 
The T7-<^II heteroduplex maps were construct>ed from measxirements on heteroduplexes 
mounted tmder four different isodenaturing formamide concentrations. At each concentration, 
ten heteroduplex molecules were measured. For every molecule, ten molecules each of single- 
stranded and double-stranded <^X174 DNA were measured in the same micrograph as length 
standards. Lengths of the single-stranded regions of the heteroduplex were first expressed as a 
function of single -stranded (^X174 DNA, and duplex regions as a function of double-stranded 
^X174 DNA. The lengths were then converted relative to the length of T7 DNA (or T3 DNA, 
which is identical withiu experimental error) using the relationship that the ratio of the molecular 
lengths of ^X174 DNA to T7 DNA (or T3 DNA) is 0136 (Davis & Hyman, 1971). To construct 
the map of one heteroduplex molecule, its parts were added sequentially from one end. Solid 
thick regions represent duplex. Two thin opposing lines represent two single strands. Regions 
were occasionally found to be duplex in one heteroduplex but single -stranded in another, on the 
same grid. These regions are represented by two opposing lines close together. The fraction of 
the ten molecules observed to be single -stranded in a given region is directly proportional to the 
distance between the two opposing Unes representing the region. The distance apart for observing 
all ten heteroduplexes as single-stranded in that region is seen in (b) from 20% to 22%. The dele- 
tion in DNA is represented by one thin line and by the symbol ddta [A) at the appropriate 
position. The lengths of the single strands of a non-homology loop were averaged. The mounting 
conditions were as follows: (a) -39°C, 30% formamide onto 6% formamide; (b) T^^, -28°C, 
45% formamide onto 15% formamide; (c) -17°C, 60% formamide onto 30% formamide; 
(d) — 6°C, 75% formamide onto 45% formamide. The physical map of T7 at the top of the 
Figure was constructed as described by Davis & Hyman (1971) with the addition of the iiiforma- 
tion provided by Studier (1972), Hyman & Sxmimers (1972), Summers et al. (1973), and Simon & 
Studier (personal communication). Pe is the early promoter region (Davis & Hyman, 1970); Te is 
the site of strong termination at the right end of the early region (Hyman, 1971; Studier, 1972). 
The spaces between genes are for ease of visualization. 
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about 3% to 7% (per cent of T7 DNA measured from the left end) ; the non-homology 
loops between 20% and 21-5%, between 24-5% and 27% (though there is a slight 
tendency for part of that region to form base pairs at SQ'^C), between 41% and 
42%, and between 50% and 51%, the long stretch of duplex between 61% and 89%. 
The few changes in the heteroduplex as a function of formamide include the region 
between 90% and 91%, duplex at T^—39°0, but smgle-stranded at iTj^— 28°C. At 
Tjn— 6*^0, there is an additional melting between 89% and 90%. The regions between 
14% and 19%, between 21-5% and 23%, and between 30% and 34% partiaUy 
melt at T^-e^'G. 
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Fig. 2. T3-^II heteroduplex maps. 
The T3-^II heteroduplex maps were constructed from measurements on heteroduplexes 
mounted under four different isodenaturing formamide concentrations as described in the legend 
to Fig. 1. For the T3-<^II heteroduplex molecules, variation arises from two sources : random error 
and partial melting. Regions that correctly appear duplex on one molecule and single-stranded 
on another are sometimes difficult to recognize. The method used to minimize the random error 
is to locate and align the invariant regions of the T3-^II heteroduplex and then to normalize 
the regions in between. The distance apart for observing all ten heteroduplexes 6is single-stranded 
in that region is seen in (d). The deletion in <t>ll DNA is represented by showing the appropriate 
non-homology loop as being composed of single strands of different length. The lengths of the single 
DNA strands composing the other non-homology loops have been averaged. There were occasion- 
ally features too small to represent graphically, particularly in a region moimted near its Tm* 
as 65% to 70% in (b). 

A sample electron micrograph of the T3-^II heteroduplex (at T^j^— 39°C) is shown 
in Plate II. The DNA deletion in ^11 was used to identify the left end of the TS-^D 
heteroduplexes. The T3-^II heteroduplex maps are presented in Figure 2. In direct 
contrast to the homology between T7 and ^11 DNAs, the extent of duplex in the 
T3-^II heteroduplex decreases with increasing formamide concentration. There is an 
extended stretch of strong homology, double-stranded at T^— 6°C, near the right 
end of the heteroduplex. The terminally redundant ends are sufficiently non-homo- 
logous to remain single-stranded at T^— 39°C. 

Prom the data of Figures 1 and 2, the overall percentage of duplex DNA versus 
formamide concentration can be plotted for both heteroduplexes, as well as T7 DNA ' 
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Plate I. T7-^II heteroduplex at -28°C. 
The T7-^II heteroduplex was mounted from a 45% formamide solution (cation concentration 
0-060 m) and layered onto a 15% formamide solution (cation concentration 0 0060 m). The tem- 
perature of mounting was 23°C. The left end of the molecule, near the deletion loop, is in the upper 
portion of the micrograph. The small circular DNAs are ^X174. The grid was shadowed with 
PtnPd (80% to 20%). 
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Plate II. T3-^II hefceroduplex at — 39°C. 
The T3-^II heteroduplex was mounted from a 30% formamide solution (cation concentration 
0*060 m) and layered onto a 5% formamide solution (cation concentration 0*0060 m). The tem- 
perature of mounting was 23°C. The left end of the molecule, near the non-homology loop composed 
of single strands of vmequal lengths, is in the upper portion of the micrograph. The small circular 
DNAs are ^X174. In this micrograph, the difference between the smooth, even duplex ^X174 
DNA and the uneven, "kinky" single -stranded ^X174 DNA is particularly clear. The grid was 
shadowed with Pt^Pd (80% to 20%). 
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Fig. 3. Melting curves. 

' . The percentage of T7 DNA. the T7-^n heteroduplex and the T3-^II heteroduplex appearing 
as duplex at each iso-denat\iring formamide concentration is plotted against that formamide 
* concentration. The data for the T7-^II heteroduplex are from Fig. 1 and for the T3-^II hetero- 
duplex from Fig. 2. The data for T7 DNA are from Davis & Hyman (1971). Included as duplex 
are those regions appearing duplex in some heteroduplexes and single -stranded in others on the 
game grid, proportional to the fraction appearing as duplex. — O — O — » T7 DNA; — • — • — , 
T7-^n heteroduplex; — A— A— T3-^II heteroduplex. 
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Fig. 4. TS-^II sequence homology map. 
The T3-^II heteroduplex maps of Fig. 2 were arbitrarily divided into 1% intervals. A melting 
curve was constructed for each 1% interval, and the formamide melting concentration was deter- 
mined. The lowering of the melting temperature relative to T7 DNA was calculated using the 
relationship that 1% formamide lowers by 0-7 deg. C (McConaughy et al., 1969) and is the 
left vertical co-ordinate. The actual percentage DNA base sequence homology was calculated 
for each 1% interval of the map using the relationship that 1 deg. C lowering of corresponds 
to 1-6% base sequence mismatch in poly(dA-dT) (Laird eiaZ,, 1969) and is the right vertical co- 
ordinate. The deletion in ^11 DNA is represented at the appropriate position by the symbol 
. ddta (J). The construction of the physical map of T7 is described in the legend to Fig. 1, and is 
^ used here by analogy. Pe, early promoter region; Te, site of strong termination at the right end 
li of the early region. 
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as a control. Figure 3 presents the melting curves. The fraction of duplex of the 
T7-^n heteroduplex is 88% duplex, with only a slight increase at 39°C, and a 
small decrease at T^— 6''C. Ldnial et al. (1970) have previously shown that T7 RNA 
hybridizes efficiently with ^11 DNA and vice versa. The amount of duplex of the 
T3-^II heteroduplex decreases continually with increasing formamide concentration; 
72% duplex at T^^— 39°C to 35% duplex at T^—^'C, This melting at formamide 
concentrations where T7 DNA appears completely duplex defines regions of partial 
homology. 

The melting curve for the T3-^II heteroduplex (Fig. 3) gives the over-all average 
percentage of duplex. However, from the data of Figure 2, the melting of each 1% 
length interval (arbitrarily chosen) of the heteroduplex can be considered separately 
to obtain a detailed map of per cent homology between T3 and ^11 (Fig. 4). 



4. Discussion 

While DNA is generally homologous to T7 DNA, in the regions between 41% 
and 42% and between 90% and 91 % (gene 17, a tail protein), <^II DNA is homologous 
to T3 DNA (Figs 1 and 4). The assignment of gene to function for T7 is from Studier 
(1972). In the region between 24-5% and 27% which includes part of gene 3 (endo- 
nuclease), gene 5.5(lysozyme) and part of gene 4 (involved in DNA synthesis), ^11 
DNA is not homologous to T7 DNA, but is about 68% homologous to T3 DNA. In 
the region between 60% and 51% (gene 10, the major head protein), ^11 is not homo- 
logons to T7 and is not more than 43% homologous to T3. In the region between 
20-6% and 22% (gene i.7, a dispensable gene of unknown function), <^II is neither 
homologous to T7 nor to T3. The deletion in ^11 DNA occurs in the region equivalent 
to gene 0.7 in T7 DNA, T7 gene 0.7 codes for a protein involved in the rapid shut-off 
of host transcription (Brunovskis & Summers, 1972). Linial & Malamy (1970) earlier 
demonstrated that <^II infection did not result in rapid host shut-oflF. The explanation 
is now obvious: ^11 DNA is deleted in that gene. 

The strongest DNA homologies among T7, T3 and ^11 are between 70% and 89%. 
This region includes genes 15 and 16 (both minor head components) and gene 17 (a tail 
protein). Apparently, alterations in the amino acid sequences of these proteins are 
not weU tolerated. 

T7 DNA, T3 DNA, and ^11 DNA have terminal redimdancies extending about 
0-7% (Ritchie et ah, 1967). The terminal redundancy of ^11 is homologous to that of ' 
T7, but is not homologous to that of T3. The function of the terminal redvmdancy is 
not known, though the recent suggestion of Watson (1972) that it plays an essential 
role in DNA repHcation is attractive. If the terminal redimdancy of the DNA is 
necessary for the sm-vival of the phage, and if two phages have non-homologous 
terminal redundancies, as do T3 and c^II or T7 and T3, the DNA products of a single, 
recombination would not be terminally redundant and, therefore, presumably would- 
not be viable. Evolutionary pressure would select for phages that have evolved an 
exclusion mechanism to prevent the production of lethal recombinants after simulta- - 
neous infection. T7 and T3 are known to exclude one another (Hausmann et oi^ 
1961). An exclusion mechanism assures that phage with non-homologous terminal 
redundancies are not capable of extensive productive recombination. Their DNAs 
will accumulate random base sequence changes. 



On the other hand 
regions of sufficient \ 
adopted for all the ] 
those regions. In regie 

An additional possi 
that one phage DNA 
attractive for ly^ogen 
degrade the host DNi 

These results suppo 
forward by Simon et i 
evolutionary patterns 
mode of replication of 

We thank Mrs Joan . 
tron micrographs. Drs '. 
data before pubhcation 
grant CA06619, and th 



Brunovskis, I. & Sumir 
Brunovskis, I., Hyman 
Davis, R. W. & Hymai 
Davis, R. W. & Hymai 
Davis, R. W., Simon, 'M 
L. & Moldave, K., 
Hausmann, R. L., Mag 

9, 511-526. 
^ Hyman, R. W. (1971).. 
: Hyman, R. W. & Smnr 
. Laird, CD., McConaug 
Linial, M. (1971). Ph.D 
' Linial, M. & Malamy, ]\ 
\ Linial, M., Malamy, M. 
KMcConaughy, B. L., La 
r Ritchie, D. A., Thomas, 
23, 365-376. 
1, R. B. & Sunomei 
{ Simon, M. N.; Davis, R. 

A. D., ed.), pp. 313 
Studier, F. W. (1969). T 
Studier. F. W. (1972). 6 
Summers, W. C, Bruno 
Watson, J. D, (1972). A 

Note added in proof. I 
by 0'7^C. Recently. 
**he by 0-6°C. Wet 
absence of formamide t< 
measured the Tj^ of T7 
0*3 M and 31-2°C in 80 
presented by Frank-Kar 
°an be used to calculate 
^ ^^a+ concentration of 



JUMMERS 

fcion of duplex of thel 
eat T„-39°C, and a) 
shown that T7 RN^ 
unt of duplex of the 
'.amide concentration- 
nelting at formamide 
nes regions of partial 

3 the over-all average 
5 melting of each 1% 
considered separately 
^11 (Fig. 4). 



regions between 41%:, 
C DNA is homologous'^ 
or T7 is from Studier;| 
)art of gene 3 (endo.<" 
DNA synthesis), ^11^ 
gous to T3 DNA. Iii | 
5in), is not homo- 
. the region between | 
3tion), ^11 is neither ' 
>he region equivalent 
in the rapid shut-off ' 
[alamy (1970) earlier 
off. The explanation 

n^een 70% and 89%. 
bs) and gene 17 (a tail 
f these proteins are 

ies extending about 
»mologous to that of 
minal redundancy is 
it plays an essential 
ncy of the DNA is 
.ve non-homologous 
products of a single 
, presumably would 
at have evolved an 
nants after simulta- 
' (Hausmann et al., 
Dmologous terminal 
ation. Their DNAs 



T7-^II AND T3-<^II HETERODUPLEX STUDIES 195 

fOn the other hand, for phages such as T7 and ^11, recombination will occur in 
Egions of sufficient DNA homology and the "best" sequence for a gene will be 
feopted for all the phage. This process will maintain complete homology within 
Rhose regions. In regions of insufficient homology, recombination will not occur. 

i An additional possibihty for the evolution of a region of complete non-homology is 
l^at one phage DNA has picked up a section of host DNA. This possibility is more 
Attractive for lysogenic phages such as A than for virulent phages such as T7, which 
tjegrade the host DNA after infection. 

if ' These results support the interpretation of evolution and subsequent selection put 
Rorward by Sunon et al, (1971) and Davis & Hyman (1971), and in addition suggest 
Evolutionary patterns for T7, T3 and ^11 which may be in part a consequence of the 
^ode of replication of these phages. 

m We thank Mrs Joan Nilea and Mr Clifford Scheiner for their measurements of the elec- 
ftron micrographs. Drs F. W. Studier and M. N. Simon generously provided us with their 
Idflta before pubUcation. This work was supported by United States Public Health Services 
fcrant CA06519, and the American Cancer Society grant VC60A. 
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Note added in proof, McConaughy et al, (1969) found that 1% formamide reduced the 
^^m by 0'7*'C. Recently, Bluthmann et al. (1973) reported that 1% formamide reduced 
the by 0-6°C. Wetmur & Davidson (1968) measured the Tj^ of T7 DNA in the 
absence of formamide to be 98°C at a Na+ concentration of 1-0 m. Record & Zinam (1972) 
measured the T^^ of T7 DNA to be 33'8°C in 89% formamide at a Na+ concentration of 
0-3 M and 31'2°C in 80% formamide at a Na+ concentration of 0-08 M. The equation 
presented by Frank-Kamenetskii (1971) relating to base composition and ionic strength 
can be used to calculate that in the absence of formamide the of T7 DNA is 99*'C at 
a Na+ concentration of 1-0 M, 92°C at a Na+ concentration of 0-3 m and 83°C at a Na+ 
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concentration of 0-08 M. The agreement between the experimental and calculated values 
for the Na* concentration of 1-0 m adds credence to the other two values. Calculations 
based on these latter data yield the relationship that 1% formamide lowers the of 
T7 DNA by 0-65°C, a number between the values reported by McConaughy et al, (1969) 
and by Bluthmann et ah (1973). Thus, all the data are in reasonable agreement. 

Laird et al, (1969) found that the was reduced by 0-7*C for each per cent mis- 
match in acid deaminated poly [d(A-T)], a purine-pyrimidine mismatch. Recently, 
TJllman & McCarthy (1973) found the was reduced by 2*2*C for each per cent 
mismatch in alkaline deaminated DNA, another type of pxirine-pyrimidine mismatch. 
Hutton & Wetmur (1973) reported that the reduced by 1-1°C for each per 

cent mismatch in glyoxylated DNA, which is a model for purine-purine mismatch. From 
these numbers, it seems reasonable at present to take as an "average" value that the 
is reduced by 1*4*'C for each per cent mismatch. From this correlation, a region that 
appears double -stranded at — 6°C has less than one base pair mismatched in 25. A 
region that appears single-stranded at Tj^ — 39°C has at least one base pair mismatched 
in three. 
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